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ABSTRACT 

One of the main challenges for future 21 cm observations is to remove foregrounds which are several 
orders of magnitude more intense than the HI signal. We propose a new technique for removing 
foregrounds of the redshifted 21 cm observations. We consider multi-frequency interferometer obser- 
vations. We assume that the 21 cm signals in different frequency channels are uncorrelated and the 
foreground signals change slowly as a function of frequency. When we add the visibilities of all chan- 
nels, the foreground signals increase roughly by a factor of ~ N because they are highly correlated. 
However, the 21 cm signals increase by a factor of ~ \J~N because the signals in different channels 
contribute randomly. This enables us to obtain an accurate shape of the foreground angular power 
spectrum. Then, we obtain the 21-cm power spectrum by subtracting the foreground power spectrum 
obtained this way. We describe how to obtain the average power spectrum of the 21 cm signal. 
Subject headings: cosmology: observations — cosmology: large-scale structure of universe — ISM: 
general — turbulence 



1. INTRODUCTION 

A promising emerging field in cosmology is the pro- 
posed use of 21-cm emission from neutral hydrogen to 
trace the evolution of structure in the universe from red- 
shift to 50. At redshifts up to about 3, by mapping the 
three-dimensional intensity field with 10 arc-minute reso- 
lution, it may be possible to precisely measure the expan- 
sion history throughou t the transition from deceleration 
to acceleration dChang et all 120081: iWvithe et alj |2008t 
lLoeb fc Wvi"thil2008HMorales fc Wvithell2010D . For red- 
shifts near 10 several instruments are under construction 
to search for 21-cm emission from the Epoch of Rcioniza- 
tion (EoR;lMorales k Hewittll2004HZaroubi fc Silldl2005t 
iBacker et al.ll2007p . Some authors even suggest that ob- 
servations at redshifts well above 10 may provide very 
sharp tests of th e world model and the co mposition of 
the cosmic fluid (|Loeb fc Zaldarriaga 20Q3)- 

Most of the observational effort so far has focused on 
searching for the EoR signal. The first goal of current 
EoR programs is to pin down the redshift of reionization. 
Later measurements will address questions such as the 
following: What were the first sources of ionizing radia- 
tion? What is the power spectrum of the HI structure, 
and how did it evolve? The first generation of 21-cm 
EoR observatories may begin to attack this last question 
by observing the emission of the neutral hydrogen itself. 
The interpretation of the data in terms of pinning down 
the UV sources is likely to be much more difficult. 

Even at the end of the EoR, there is still enough HI 
(a few percent of all the hydrogen) in the form of self- 
shielded clumps to be visible at a brightness tempera- 
ture contrast of AT ~ 1 mK. We have some informa- 
tion about these structures since they are seen via UV 
absorption as Damped Lyman Alpha regions, or DLA's 
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(jWolfe et al.l 12005). Observation of 21-cm fluctuations 
from such systems provides a tool for understanding 
structure formation in the post-reionization era. These 
HI regions presumably trace the dark matter and hence 
can be used to study the evolution of the matter power 
spectrum. Measurement of the cosmic power spectrum as 
a function of redshift over the cosmic volume 0.5 < z < 6 
could provide cosmic-variance limited determination of 
cosmological parameters such as the eq uation of state of 
the d ark energy and the neutrino mass (jLoeb fc Wyithd 
l2008h . 

Multiple lines of observational evidence now indicate 
that dark energy accounts for ~ 70% of the energy den- 
sity in the universe, but so far we have few clues as to the 
physics underlying this phenomenon. There are a host 
of dark energy models and these can be distinguished 
by their equations of state. A promising approach is 
to infer the equation of state from the distance-redshift 
relation in the redshift range of to about 2. In par- 
ticular, baryon- acoustic oscillations (BAO) can provide 
a standard ruler for distance determinations. The BAO 
'wiggles' in the matter power spectrum can be measured 
efficiently using 21-cm intensity mappin g (Chan g et al.l 
[20081: lAnsari et al.ll2008l l20Tll ISeo etallboion 

To achieve these scientific goals a wide variety of in- 
novative 21-cm telescope designs have been proposed, 
som e prototypes built, and some telescopes completed 
(see lMorales fc Wy ithc 2010L for a review of these instru- 
ments and their scientific potential). However, there are 
many development tasks needed to advance the state of 
the art of 21-cm cosmology, including calibration strate- 
gies, low-noise amplifier development, correlator design, 
examination of array layouts, radio-quiet site develop- 
ment, and foreground removal. 

One of the main difficulties of all 21-cm telescopes is 
to remove foregrounds which are several orders of mag- 
nitud e more intense than the HI signal (Morales et al. 
2006). The basic idea is that most foregrounds have 
a smooth power-law frequency dependence in contrast 
with the HI signal. The exception is Galactic radio 



Cho et al. 



recombination lines which occur at known frequencies 
and can be excised. The main foregrounds are the 
Galactic synchrotron radiation and extragalactic point 
sources. Other sources like free-free electron emission 
(i.e. Bremsstrahlung) are much less intense and have 
power-law spectra similar to the first two components. 

Other foreground removal t echniq ues have been pro- 
posed as well. Morales et al. (HOOG) review these tech- 
niques and show how to correct the errors t hey ca use in 
the recovered power spectrum. Liu et al. (2009]) show 
that performing foreground removal in Fourier space 
eliminates errors that arise from frequency-dependent 
beam patterns (mode mixing). Liu et al. (|2011f) have 
developed a technique for estimating the power spectrum 
in the presence of foregrounds without introducing noise 
or bias to the power spectrum. 

Simulations by Bowman, Morales and Hewitt (2009) 
for the Murchison Widefield Array remove the contribu- 
tion from bright identified sources and then in each RA- 
dec pixel fit and subtract a polynomial in v. This two- 
step strategy allows for the observation of the HI signal 
at the time of reionization (z ~ 8). Similar foreground 
removal simulations have been applied to candidate dark 
energy observatories; Ansari et al. 120111 show how fore- 
grounds might be separated from the HI signal based 
on their differing spectral signatures as measured by an 
interferometric array of dishes or cylinder telescopes. 

Below we discuss a different idea for removing fore- 
grounds which is based on using the power spectrum. 
The physical basis of the idea is that at different frequen- 
cies the fluctuations of redshifted HI are not correlated, 
as we sample completely different regions of HI, while 
the foreground fluctuations are correlated as they arise 
from the same galactic inhomogeneities, e.g . tur bulent 
fluctuations as discussed in Cho & Lazarian 2002; 2010. 
However, unlike Cho & Lazarian (2010) the suggested 
analysis does not require assumptions about the power 
spectrum of fluctuations. 

In §2 we discuss the technique that we propose, in §3 
the applicability of our technique, and in §4 we summa- 
rize our results. 

2. METHOD 

2.1. Assumptions 

We assume the following: 

1. The observations are made at N different frequency 
channels (yi, i=l,..., N) and data are collected in 
uv coordinates. We assume that 8v (= j/j+i — Vi) 
is constant. We denote the visibility at channel i 
as T Vi (u, v). 

2. Observations are made for a small patch on the 
sky. We use (9 X , 9 y ) to denote a position on the sky 
plane. We use either {k x ,k y ) or (u,v) to denote a 
position in Fourier space. Note that I = 2ir(u 2 + 
v 2 ) 1 ' 2 (jWhite et al.llT999h and Cf oc \T v% {u,v)\ 2 = 
0-/N sum )Y^i/2-K \Tvi(u,v)\ , where the summation 

is done for (l/2n) < vV + v 2 < (Z/2tt) + (A//2tt) 
and N sum is the number of modes used for the sum- 
mation. 

3. The 21-cm signals from different frequency chan- 
nels are uncorrelated. 
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Fig. 1. — Method. Left: Observation are done in N different fre- 
quency channels (y = v\, ...,i/jv). The foreground signals are sev- 
eral orders of magnitude stronger than the 21-cm signal (low-left 
panel). We assume that the 21-cm signals from different channels 
are uncorrelated. However, we assume the foreground signals from 
different channels are highly correlated. Right: When we add all 
T„. (u, v)'b, the foreground signals are enhanced by a factor of ~ N, 
while the 21-cm signal is enhanced by a factor of ~ y/W. In terms of 
angular spectrum, the foreground spectrum goes up by a factor of 
~ N 2 and the 21-cm spectrum goes up by ~ TV. The angular spec- 
trum of T avg (u, v) = £. T Vi (u, v)/N will be ~ c[ or + Cf lcm /N, 
where 'for' and '21-cm' stand for the foreground and the redshifted 
21-cm, respectively. Therefore, the spectrum of T aV g{u, v), C^ vg , 
should represent an accurate shape of the foreground spectrum. 
We can obtain the 21-cm spectrum by subtracting the foreground 
spectrum. 

4. The cross-correlation between the foreground and 
the 21-cm signals is negligible. This is not a trivial 
issue if the 21-cm signal is much weaker than the 
foreground one. We will come back to this in S}3] 

5. The noise level is lower than the 21-cm signal. See 
also discussions in $3] 

2.2. Basic idea 

We can obtain an accurate shape of the foreground 
angular power spectrum in the following way. Suppose 
that we have multi-channel observation data in uv coor- 
dinates (left panel of Fig. 1). We stack all the uv plane 
data into a single file (right panel of Fig. 1). That is, we 
calculate ^2 i=l T Vi (u,v). (If the observations are done 

in real space, we first obtain ^2 i= iT Ui (8 Xl d y ) and then 
we perform a Fourier transform.) In the stacked map, 
since the foreground signals in different channels are cor- 
related, they are enhanced by a factor of ~ N. However, 
the noise and the 21-cm signals are enhanced by a fac- 
tor of \/N because they contribute randomly. Therefore, 
the contrast between the foreground and the noise/2 1-cm 
angular spectra will be increased by ~ N times. In this 
way we can obtain an accurate shape of the foreground 
power spectrum. We define T avg (u,v) as 



1 N 
T avg (u,v) = —^2t^(u,v). 



(1) 
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What, then, does T avg (u,v) mean? If T v (u, v) varies 
slowly as a function of v, we can show that the ob- 
tained foreground power spectrum is a good represen- 
tation for the power spectrum at the central channel 
(y = V(n+i)/2)- Suppose that N is an odd number. Then 
we can write 



Tl or iu, v) = T/ or , (u, v) + aAu, 



(2) 



where a is a constant, Av — v t — V{n+\)]2 and the 
superscript 'for' denotes the foreground. We assume 
\aAv\ <§; \Tl° r \ and we ignore the second-order quanti- 
ties. From Eqs. ([T]) and ([2]), we have jj Yj%=i Tvi(u,v) « 
Tl OT (u.v), which results in 

"(JV + 1)/2 V ' <" 



fj av 9 
~iVi ,for 



c[ 



Iy (JV + l)/2> for 



(3) 



where C"" 9 and Cp' Jm are the angular power spectra of 
T aV g(u,v) and T/ or (u,w), respectively. 

The foreground power spectrum (w C™ 9 ), as well 
as the foreground map (w T avg (u,v)), can be used for 
other observations. For example, when a future low- 
noise single channel observation becomes available at 
v = l/ (N+i)/2, we can use C"" 9 or T avg {u,v) to subtract 
the foreground. It is also possible to use them for future 
high angu lar re solution low-noise observations. Cho & 
Lazarian (J2010D argued that when accurate foreground 
observations are available for a certain range of Z's, we 
can extrapolate the foreground spectrum to a range of I 
where no foreground information is available. Therefore, 
if a very high angular resolution single-frequency obser- 
vation is performed in the future, we can extrapolate the 
foreground spectrum (i.e. C™ 9 ) to higher values of I and 
extract the 21-cm information on the small scales. 

2.3. Proposed techniques 
Case 1: same power spectrum shape for all 
channels — In this case, we can easily obtain C"*' cm . 
Note that, at this moment, we know only the shape (the 

/-dependence) of C\ ui ° r . The amplitude can be differ- 
ent in each frequency channel and we do not know it 
yet. We can determine the amplitude by fitting or by 
the following method. Suppose that the value of 



5>og(CH-log(aCr)| 2 
i 

has a minimum value at a — c^. Then we have 
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Case 2: linear dependence of T(u,v) on v — In 

this case, as we discussed in the previous subsection, we 



have C"" 9 
is 



q (w+i)/2,/or ^ rpj^ 2i_ cm spectrum at v = V{ 
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From Eq. @, we have 



Vijor ,^(iv + l)/2i/ or 



(~,Vi,jor rt 



AAv, 



where A = Re[2J2 l/27r T^ 



,(u,v)a* 



Here 



(6) 



(7) 
de- 



for 
jl/2-n J "i'(jv + l)/2 v 

notes the complex conjugate and Re[...} stands for the 



real part. Therefore, the average spectrum of the 21-cm 
signal over N channels is 



1 N 1 / N \ 

i=l \i=l / 



riavg 



(8) 



Case 3: quadratic dependence of T(u,v) on v-hv 
this more general case, we can obtain the average 21-cm 
spectrum. Suppose that N—even for simplicity. Then 
we can write 



Tl or (u, v) = f + aAv + bAv 2 , 



(9) 



where fo is a visibility at vq — {vn/2 + v i+n/2)I^ ( n °t 
observed), a and b are constants, Av = vi — v§ = [i — 
(N + l)/2]Sis, and 8v = z^ + i — Vi. We note that 



\T avg (u,v)\ 2 = 



l/o| 2 



/o + 



2N 



N/2 

Avl^j-if 

i=i 

N/2 



N 



-Avl 



(2j- l) 2 Re[f b*]. (10) 



.7 = 1 



On the other hand, we have 



N/2 

E 



\T„ z (u,v) + T„ N _ z+1 (u,v)\ 2 (11) 
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2f + -(2j-l) 2 Sv 2 
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where we dropped the cross-correlation terms between 
the 21-cm and the foreground signals (see Q. If we 
subtract Eq. (flU|) times 2N from Eq. (TTTj) , we have 

Y^iLi \Tvl cm ?- Therefore, the average spectrum of the 
21-cm signal is 



, N N/2 



(12) 



where DY* is the angular spectrum of T Vi (u, v) + 

3. DISCUSSION 

Our technique critically depends on the assumption of 
slow variation of the T/ or (u, v) over frequency channels. 
This assumption may need further investigation. How- 
ever, we believe that this is a good assumption due to 
the following reasons. First of all, we know that the 
major foreground, which is the synchrotron foreground, 
is caused by MHD turbul ence (see the statistical de - 
scription of fluctuations in Laz arian &: Po gosvan 2012). 
The variations with frequency from an elementary syn- 
chrotron emitting volume are given by ex no-B 2 ^ 1- "'' 2 

4 More precisely, the right hand side is multiplied by N/(N — 2). 
Similarly, the right hand side of Eq. (01 is multiplied by N/(N — 1). 
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where B± is the magnetic held strength perpendicular to 
the line of sight and the number density of cosmic rays 
has a power law distribution n(E)dE — rioE~ a dE ~ 
rioE~ 3 dE. Therefore we expect the spectral components 
to change as oc (v/vq) . This dependence can be com- 
pensated over the frequency interval of Av — (N — l)8v 
and the residual emission, e.g. arising from other sub- 
dominant components of the foreground, may be treated 
with the suggested technique. The dependence of the 
other foreground compone nts, e.g. free-free, dust and 
spinning dust ( see lLazarian fc Finkbeinerj 120031 for a re- 
view) on the frequency is weak and therefore the lin- 
ear/quadratic expansion in frequencies in Eq. ([I])/© 
should be also valid. If we observe, for example, 610 
MHz, the 21-cm signal is believed to be deco rrelate d if 
A^ > 0.5 MHz (see for example Ghosh et al. 120111 ). If 
we use 12 channels, we have A1//1/0 = 5.5/610 w 0.01. 
Therefore, any third or higher order effects should be 
very small. For synchrotron, since {v/vq) -1 — J2^Lo(^-~ 
v/vo) n , the ratio of the third to the second term of the 
expansion is ~ 0(v /vq — 1) ~ 0.01. For free-free the fre- 
quency dependence is weaker and truncation error will 
be smaller. 

In this paper we assume that the cross-correlation be- 
tween the foreground and the 21-cm signals is negligible. 
This needs further verification. In general, we can write 

Ci = C( or + Cf lcm + Cf c , where 



C? c = Re 



N» 



zZ Tfo 

1/27T 



{u,v)T (u,v) 



(13) 



is the cross-correlation term. When the 21-cm signal is 
7 times smaller than the foreground one (i.e. T 21cm ~ 
T^ or /j), the cross-correlation term Cf c can be larger 
than Cf lcm if the number of modes used for the sum- 
mation in Eq. ([TBI is less than ~ 7 2 . 

However, if we use Eq. (|12p or (JSJ), we may circumvent 
this difficulty. Let us consider the more general case of 
Eq. (fT2j). The cross-correlation term that will appear 
additionally in the right-hand side of Eq. (IT21 is 



C; cc = Re 



N> 



■Ec^-b) 

l/2ir 



N/2 



A=j f J2[(2fo + 2bAv 2 )(T< 

8=1 

2 

TV 2 



21cm 1 rp21cm > 



B 



V/o + bAv* 




\ A rp21cm* 



.(14) 

^(15) 
,(16) 



\A-B\<2\b\Avi ax \T^ m \HN,(l7) 



where the variables are defined similarly (i.e. Av — 
vi — vq, N=even,...), 'A' and 'B' are from the first and 
the second terms on the right-hand side of Eq. (|12l) , re- 
spectively, Av max — un — vq, and we dropped '(u,v)' for 
simplicity. Note that the terms containing /o cancel out. 
Therefore, if 
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Jinn's. 



<\T: 
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correlation term is indeed larger than the 21-cm spec- 
trum Cf lcm for any reason, our technique can reveal the 
cross-correlation spectrum. Even in this case, it may not 
be difficult to infer the shape of Cf lcm from the shape of 
Cf. 

Our technique is diffe rent from standard polynomial- 
fittin g approaches (e.g., iMcQuinn et al.l [20061 : iLiu et all 
2009). First, although we assume a polynomial depen- 
dence of the foregrounds on frequency, we do not need 
the actual fitting stage. Therefore our technique can be 
used as a synergistic technique to previous standard sug- 
gestions. Second, since Ci oc J2 I^L we expect that the 
high-order frequency dependence of T may cancel out 
during the summation. Therefore, in the power spec- 
trum we may have a smoother signal which is much eas- 
ier to constrain. Third, our technique can give a use- 
ful insight for removing noise. Suppose that noise sig- 
nals are larger than or comparable to the 21-cm signals 
and that the 21-cm signals at v and v' are correlated if 
\v — v'\ < Av = i/j_|_i — Vi. In this case our technique will 
give an average spectrum of the noise and the 21-cm sig- 
nals, ~ C? mse + Cf lcm . Now, we add Af-1 new channels 
between each v% and Vi+i (i = 1, ..., N), so that there are 
a total of NM channels. Here we assume that the total 
bandwidth stays roughly the same and the bandwidth 
of each channel is now ~ M times narrower than be- 
fore. Then we apply our technique again. Note that the 
21-cm signals are correlated in adjacent channels, while 
the noise signals are not. The newly obtained average 



we can neglect the cross-correlation term. If the cross- 



spectrum is now proportional to ~ d imse + Cf lcm /\/M. 
Therefore, by comparing the two results, we may extract 
the spectrum of the 21-cm signals. For example, if the 
noise and the 21-cm signals have different dependence on 
I (or k), we can apply the technique in Cho & Lazarian 
(2010). 

4. SUMMARY 

We have described a new technique to remove the fore- 
ground from redshifted 21-cm observations. We have as- 
sumed that the foreground signals change slowly as the 
frequency changes: we have assumed up to a quadratic 
dependence of T/. or (u,w) on frequency. We obtain the 
foreground spectrum by simply adding all the observed 
T Vi (u, v). If the observations are done in real space, we 
add all the T Vi (9 Xl 9 y ) in real space first and then we per- 
form the Fourier transform. When we add T Ui (u, v) of all 
channels, the foreground spectrum goes up by a factor of 
~ N 2 because they are highly correlated. However, the 
21-cm spectrum goes up by a factor of ~ N because the 
signals in different channels contribute randomly. This 
way, we can obtain an accurate shape of the foreground 
power spectrum. Then, we obtain the 21-cm power spec- 
trum by subtracting the foreground power spectrum ob- 
tained this way. We have described how to obtain the 
average 21-cm spectrum (Eqs. [8] and [12]). We have 
derived the condition for neglecting the cross-correlation 
term (Eq. [IS]). 
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